WPI - CS - TR - 00 - 16 May 2000 Scalable Maintenance in Distributed Data

نویسندگان

  • Lingli Ding
  • Xin Zhang
  • Elke A. Rundensteiner
چکیده

The maintenance of data warehouses is becoming an increasingly important topic due to the growing use, derivation and integration of digital information. Most previous work has dealt with one centralized data warehouse (DW) only. In this paper, we now focus on environments with multiple data warehouses that are possibly derived from other data warehouses. In such a large-scale environment, data updates from base sources may arrive in individual data warehouses in diierent orders, thus resulting in inconsistent data warehouse extents. We propose a registry-based solution strategy that addresses this problem by employing a registry agent responsible for establishing one unique order for the propagation of updates from the base sources to the data warehouses. With this solution, individual data warehouse managers can still maintain their respective extents autonomously and independently from each other, thus allowing them to apply any of the existing incremental maintenance algorithms from the literature. We demonstrate that this will indeed achieve consistency across all data warehouses. In order to achieve scalability of this approach, we further optimize this registry solution by partitioning the set of data warehouse managers into smaller data warehouse groups each equipped with their own registry. We present a partitioning algorithm for generating such a scalable DW group architecture. Finally, we analyze the performance of the proposed solutions based on a cost model, and demonstrate that the partitioned registry approach achieves substantially better performance than the registry solution.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Framework for Visualizing the Behavior of Component-Based Software Systems

Matt Ward and George Heineman Computer Science Department WPI Worcester, MA 01609 WPI-CS-TR-01-19 {matt,heineman}@cs.wpi.edu

متن کامل

WPI - CS - TR - 98 - 8 August 1998 Data Warehouse Maintenance Under Concurrent Schema

Data warehouses (DW) are built by gathering information from several information sources and integrating it into one repository customized to users' needs. Recently proposed view maintenance algorithms tackle the problem of (concurrent) data updates happening at diierent autonomous ISs, whereas the EVE system addresses the maintenance of a data warehouse after schema changes of ISs. The concurr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000